CRL/BRANDEIS: The Diderot System
نویسندگان
چکیده
Because of the emphasis on different languages and different subject areas the research has focused on the development of general purpose, re-usable techniques. The CRL/Brandeis group have implemented statistical methods for focusing on the relevant parts of texts, programs which recognize and mark names of people, places and organizations and also dates. The actual analysis of the critical parts of the texts is carried out by a parser controlled by lexical structures for the 'key' words in the text. To extend the system's coverage of English and Japanese some of the content of these lexical structures was derived from machine readable dictionaries. These were then enhanced with information extracted from corpora.
منابع مشابه
CRL/Brandeis: description of the Diderot system as used for MUC-5
This report describes the major developments over the last six months in completing th e Diderot information extraction system for the MUC-5 evaluation . Diderot is an information extraction system built at CRL and Brandeis University over th e past two years. It was produced as part of our efforts in the Tipster project . The same overall system architecture has been used for English and Japan...
متن کاملThe Diderot Information Extraction System
Diderot is an information extraction system built at CRL and Brandeis University over the past year. It was produced as part of our eeorts in the Tipster project. Diderot has already been converted from one subject domain to another and versions of the system have been made for two languages. The same system architecture has been used for English and Japanese and a comparison is made of the pro...
متن کاملCRL/NMSU and Brandeis MucBruce: MUC-4 test results and analysis
The Computing Research Laboratory (New Mexico State University) and the Computer Science Departmen t (Brandeis University) are collaborating on the development of a system (DIDEROT) to perform data extraction for the Tipster project . This system is still far from fully developed, but as many of the techniques being used are domain —and in many cases language— independent, we have assembled the...
متن کاملCRL/NMSU and Brandeis: description of the MucBruce system as used for MUC-4
Through their involvement in the Tipster project the Computing Research Laboratory at New Mexic o State University and the Computer Science Department at Brandeis University are developing a method fo r identifying articles of interest and extracting and storing specific kinds of information from large volumes o f Japanese and English texts . We intend that the method be general and extensible ...
متن کاملSex differentiation in goat fetus
Reproduction in domestic animals, as a major source of food and other products for human, has greatimportance and study of related subjects including sex differentiation and gonadogenesis during fetal life can solve many questions on normal development and various disorders of urogenital system. Since studies on sex differentiation in goat fetus are scarce, this study was performed. Twenty-five...
متن کامل